Add an example to the readme? #2067

mcabbott · 2022-09-26T18:52:31Z

This proposes to add a very simple Flux model to the readme. Almost the simplest thing I could think of, which actually works.

It's nice to see code right away. I guess it won't make sense if you've never seen a neural network, but if you've ever tried any demo elsewhere, this should look familiar, and point out the main features. Including (I hope) that batch dims are last, that the model is a mutable object, how to move to GPU, what arguments train! needs (and in what order).

Without BatchNorm this works 90% of the time, with default Adam() too just 80%. So the post-1969 details are doing something.

On Julia 1.7+, just pasting this in will install everything including CUDA. ~~Maybe it should say that. I didn't want to put too many words.~~ Now it does.

mcabbott · 2022-09-26T19:03:10Z

README.md

+
+[![](https://img.shields.io/badge/docs-stable-blue.svg)](https://fluxml.github.io/Flux.jl/stable/) [![](https://img.shields.io/badge/chat-on%20slack-yellow.svg)](https://julialang.org/slack/) [![ColPrac: Contributor's Guide on Collaborative Practices for Community Packages](https://img.shields.io/badge/ColPrac-Contributor's%20Guide-blueviolet)](https://github.com/SciML/ColPrac) [![DOI](https://joss.theoj.org/papers/10.21105/joss.00602/status.svg)](https://doi.org/10.21105/joss.00602)
+
+[![][action-img]][action-url] [![][codecov-img]][codecov-url]


Moved because these badges wrapped to 2 lines, and seemed to be in random order. This puts the CI ones on 2nd line, and centres them below the logo.

I think FluxML projects should also include the "number of downloads" badge in their READMEs -

... and so on

Could do, 88k sounds like a lot...

README.md

Saransh-cpp · 2022-10-02T09:33:38Z

README.md

+[![](https://img.shields.io/badge/docs-stable-blue.svg)](https://fluxml.github.io/Flux.jl/stable/) [![](https://img.shields.io/badge/chat-on%20slack-yellow.svg)](https://julialang.org/slack/) [![ColPrac: Contributor's Guide on Collaborative Practices for Community Packages](https://img.shields.io/badge/ColPrac-Contributor's%20Guide-blueviolet)](https://github.com/SciML/ColPrac) [![DOI](https://joss.theoj.org/papers/10.21105/joss.00602/status.svg)](https://doi.org/10.21105/joss.00602)
+<br/>
+[![][action-img]][action-url] [![][codecov-img]][codecov-url]


Also, it might be better to arrange these badges in a table? Something like this -

Type Badge/Status

CI

Documentation

Community

DOI and version

But this would take more vertical space

This is more logically organised. But I'm not sure these badges are really so valuable. Knowing to click the blue button is a quick way to get to the docs on an unknown package, but perhaps we should trim some of the others.

CarloLucibello · 2022-10-07T02:17:43Z

The example feels too smart, but on the other hand it could be more catchy than a simpler mnist example.

mcabbott · 2022-10-07T13:32:23Z

I wondered about MNIST, but I think it'll end up a fair bit longer... and more of the code dealing with loading data etc, rather than core Flux + basic Julia stuff?

Could do linear regression pretty compactly, but that seems less illustrative.

I do think it should be short. Having a long nicely explained example means it starts competing with the docs as an alternative way to learn, and we already have too many paths. Just a minimal thing you can copy-paste on day 1 & see that everything is installed and working.

this is a slightly weird feature of our API... and since we also call logitcrossentropy a loss function, perhaps we should emphasize that loss(x,y) is a new thing just for this model, not a function for all time.

mcabbott · 2022-10-07T14:09:39Z

README.md

+model = Chain(Dense(2 => 3, sigmoid), BatchNorm(3), Dense(3 => 2)) |> gpu
+optim = Adam(0.1, (0.7, 0.95))
+mloss(x, y) = Flux.logitcrossentropy(model(x), y)  # closes over model
+
+Flux.train!(mloss, Flux.params(model), data, optim)  # updates model & optim


BTW, this business of defining a loss which, unlike what Flux.Losses calls a loss, closes over the model, was pretty confusing to me when I first tried Flux.

train! needs two different objects with implicit references to the model (and optim ends up with them), but does not seem to see the model itself? That's pretty weird.

IMO we should define crossentropy(m, x, y) = crossentropy(m(x), y) globally, and make Flux 0.14 do this, which is much less confusing:

model = Chain(Dense(2 => 3, sigmoid), BatchNorm(3), Dense(3 => 2)) |> gpu optim = Flux.setup(model, Adam(0.1, (0.7, 0.95))) # stores Adam's momenta Flux.train!(Flux.logitcrossentropy, model, data, optim) # updates model & optim

This seems the way forward to move train! away from Params.

add example to readme

1b7cf9d

mcabbott added the documentation label Sep 26, 2022

This comment was marked as off-topic.

Sign in to view

mcabbott commented Sep 26, 2022

View reviewed changes

This comment was marked as off-topic.

Sign in to view

mcabbott added 3 commits September 26, 2022 15:15

add installation note, tweak

b490002

tweak wording, remove tutorials link

a22a904

try adding comments

37cdcca

Saransh-cpp reviewed Oct 2, 2022

View reviewed changes

mcabbott added 2 commits October 2, 2022 07:17

mv slack to text, add downloads badge

6689513

change to use logitcrossentropy

1a42167

mention that loss(x,y) closes over model

e1ca78b

this is a slightly weird feature of our API... and since we also call logitcrossentropy a loss function, perhaps we should emphasize that loss(x,y) is a new thing just for this model, not a function for all time.

mcabbott commented Oct 7, 2022

View reviewed changes

CarloLucibello approved these changes Oct 8, 2022

View reviewed changes

mcabbott merged commit dfc5a7e into master Oct 8, 2022

mcabbott deleted the mcabbott-patch-2 branch October 8, 2022 11:36

ToucheSir mentioned this pull request Nov 9, 2022

[docs] Highlight update! API more to attract DL researchers #2104

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an example to the readme? #2067

Add an example to the readme? #2067

mcabbott commented Sep 26, 2022 •

edited

Loading

This comment was marked as off-topic.

mcabbott Sep 26, 2022 •

edited

Loading

Saransh-cpp Oct 2, 2022

mcabbott Oct 2, 2022

This comment was marked as off-topic.

Saransh-cpp Oct 2, 2022 •

edited

Loading

mcabbott Oct 2, 2022

CarloLucibello commented Oct 7, 2022

mcabbott commented Oct 7, 2022

mcabbott Oct 7, 2022 •

edited

Loading

CarloLucibello Oct 8, 2022


		[![](https://img.shields.io/badge/docs-stable-blue.svg)](https://fluxml.github.io/Flux.jl/stable/) [![](https://img.shields.io/badge/chat-on%20slack-yellow.svg)](https://julialang.org/slack/) [![ColPrac: Contributor's Guide on Collaborative Practices for Community Packages](https://img.shields.io/badge/ColPrac-Contributor's%20Guide-blueviolet)](https://github.com/SciML/ColPrac) [![DOI](https://joss.theoj.org/papers/10.21105/joss.00602/status.svg)](https://doi.org/10.21105/joss.00602)

		[![][action-img]][action-url] [![][codecov-img]][codecov-url]

Add an example to the readme? #2067

Add an example to the readme? #2067

Conversation

mcabbott commented Sep 26, 2022 • edited Loading

This comment was marked as off-topic.

mcabbott Sep 26, 2022 • edited Loading

Choose a reason for hiding this comment

Saransh-cpp Oct 2, 2022

Choose a reason for hiding this comment

mcabbott Oct 2, 2022

Choose a reason for hiding this comment

This comment was marked as off-topic.

Saransh-cpp Oct 2, 2022 • edited Loading

Choose a reason for hiding this comment

mcabbott Oct 2, 2022

Choose a reason for hiding this comment

CarloLucibello commented Oct 7, 2022

mcabbott commented Oct 7, 2022

mcabbott Oct 7, 2022 • edited Loading

Choose a reason for hiding this comment

CarloLucibello Oct 8, 2022

Choose a reason for hiding this comment

mcabbott commented Sep 26, 2022 •

edited

Loading

mcabbott Sep 26, 2022 •

edited

Loading

Saransh-cpp Oct 2, 2022 •

edited

Loading

mcabbott Oct 7, 2022 •

edited

Loading